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1. Introduction 


The output file produced by the UNIX® system as assembler and the Id link editor is in a 
format called the Common Object File Format (COFF). Several target processors use this format 
and operating systems besides the UNIX system also use it. Hence the word Common is both 
descriptive and widely recognized as a unique name. Because systems other than the UNIX 
operating system use COFF, the format includes some symbols and fields that appear to be 
extraneous. These items are extraneous for UNIX operating system users, and are included 
only to maintain commonality. 


Currently, this object file format is used for the 
VAX+-11/780 and 11/750 UNIX operating systems. 


The object file supports user-defined sections and contains extensive information for 
symbolic software testing. An object file consists of a file header, optional header 
information, a table of section headers, the data corresponding to the section headers, 
relocation information, line numbers, and, finally, a symbol table. Figure 1 shows the overall 
structure. 


The last three sections (relocation, line numbers, and the symbol table) may be missing if the 
program was linked with the -# option of the UNIX link editor or if the relocation 
information and symbol table were removed by the UNIX strip command. Also, if there 
were no unresolved external references after linking then the relocation information is no 
longer needed and will be absent. 


An object file that contains no errors or unresolved references can be executed on the UNIX 
operating system. 
The Common Object File Format is not only simple enough to be easily incorporated into 


existing projects, but advanced enough to meet the needs of yet unspecified operating 
systems. Some key features are: 


¢ Applications may add system-dependent information to the object file without 
causing access utilities to become obsolete. 


° A wealth of symbolic information is provided for the use of debuggers and other 
applications. 


° Many modifications in object file construction may be made by the user or 
application at compile time. 


2. Definitions 
The object file specification uses the following terms: 


® UNIX is a trademark of Bell Telephone Laboratories, Inc. 
+ VAX is a trademark of the Digital Equipment Corporation. 
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Optional Information 
(UNIX system a.out header) 


Section 1 Header 


Raw Data for Section 1 


Raw Data for Section n 


Relocation Info for Sect. 1 
Relocation Info for Sect. n 
Line Numbers for Sect. 1 
Line Numbers for Sect. n 


SYMBOL TABLE 


Figure 1. Object File Format 


Section A section is the smallest portion of an object file that can be 
relocated and can be treated as one separate, distinct entity. In 
the default case, there are three sections, named .text, data, and 
-bss. Additional sections are used for accommodating multiple 
text or data segments, shared data segments, or user-specified 
sections. 


Physical Address This is the physical location in memory where a section is loaded. 


Virtual Address This is the offset of a section with respect to the beginning of its 
segment or region. All relocatable references in a section assume 
that section occupies the virtual address at execution time. 


3. File Header 


The file header contains the twenty bytes of information described in Table 1. The last two 
bytes are flags that are used by Id. The UNIX Reference manual page for FILEHDR(4) gives 
the exact C language structure for the file header. 


TABLE 1. FILE HEADER CONTENTS 


Byte: Declaration Name Description i 
0-1 unsigned short f magic Magic number as defined by the symbol 
MAGIC in file a.out.h. 


2-3 unsigned short fnscns Number of section headers (equals the 

number of sections) 

Time and date stamp indicating when the file 

was created relative to the number of elapsed 

seconds since 00:00:00 GMT, Jan 1, 1970. 

8-11 long int fsymptr File pointer containing the starting address of 
the symbol table 


12-15 long int fnsyms Number of entries in the symbol table 


16-17. unsigned short fopthdr Number of bytes in the optional header 
18-19 unsigned short f flags Flags, see Table 2. 


long int f timdat 


The size of optional header information, f_opthdr, should be used by all referencing 
programs that need to seek to the beginning of the section header table. The size of the 
UNIX system optional header is currently 28 bytes on the VAX systems, 


3.1 Flags 


The last two bytes of the file header are flags that describe the type of the object file. The 
UNIX operating system version of the COFF has no use for some of these flags, but keeps 
them to maintain commonality. 


TABLE 2 FILE HEADER FLAGS 
Mnemonic Flag Meanings 


F_RELFLG 00001 Relocation information stripped from file 
F_EXEC 00002 File is executable (ie. no unresolved external 
references) 


F LNNO 00004 Line numbers stripped from file 


F SWABD 00100 This file has had its bytes swabbed (i.e., the 
bytes of symbol table name entries have been 
reversed.) 


F_AR32WR 00400 Created on AR32WR machine (e.g. VAX- 
11/780) 
F AR32W 01000 Created on AR32W machine 


F PATCH 02000 Not applicable to UNIX operating system 


The notation "“ARIGWR" in Table 2 signifies the architecture of the host machine where the 
file was created. The AR stands for architecture, the digits give the number of bits per word, 
W signifies left-to-right byte-ordering (most significant bit first), and WR signifies right-to- 
left byte-ordering (least significant bit first). 


4. Optional Header Information 


The template for optional information varies among different systems that use the COFF. 
Applications place all systems-dependent information into this record. General utility 

programs (e.g., the table access library functions, the disassembler, etc.) can be made to work 
properly on any Common object file by seeking past this record using the size of optional 
header information in the file header (bytes 16-17). 


4.1 Standard UNIX System a.out Header 


By default, files produced by the link editor always have a standard UNIX system a.out 
header in the optional] header field. The VAX version of the optional header is 28 bytes long, 


The fields of the optional header 
are described in table 3. 


TABLE 3. OPTIONAL HEADER CONTENTS 


Bytes Declaration Name Description 
0-1 short magic Magic number 


12-15 long int bsize Size of uninitialized data 
20-23 long int dum2 Unused dummy field 
32-35 long int data start Base address of data 


The magic number for the UNIX operating system is a machine-dependent constant that can 
be found in the header file a.out.h(4). 


42 C Data Structure 


The following C language struct declaration is currently used for standard UNIX system a.out 
file header: 


typedef struct aouthdr { 


short magic; {* magic number */ 
short vetamp; 7° version stamp °/ 
long _taize; /* text size in bytes, padded */ 
/° to full word boundary °/ 
jong = dsize; /* initialized data size * / 
long _ibeize; /* uninitialized data size °/ 
#if w3b 
long dumi; /* unused dummy field °/ 
dum2; f° unused dummy field °/ 
#endif 
long entry; /* entry point */ 
long __ text_start; /° base of text for this file °/ 
long  data_start; /* base of data for this file */ 
} AOUTHDR; 


5. Section Headers 


Every object file has a table of section headers to specify the layout of data within the file. 
Each section within an object file also has its own header. 


The section header table consists of one entry for every section in the file. Each entry 
contains the following information about the section: 


TABLE 4 SECTION HEADER CONTENTS 

0-7 char 8_name 8 char null padded section name 
8 -11 long int 8 _paddr Physical address of section 
12-15 _jong int s_vaddr Virtual address of section 


16-19 long int 8 size Section size in bytes 
24-27 long int s_relp File ptr to relocation entries 


32-33 unsigned short s_nreloc Number of relocation entries 
34-35 unsigned short s_ninno Number of line number entries 


36-39 long int flags Flags (see below) 


Section sizes are always padded to a multiple of 4 bytes. 


File pointerc are byte offsets that can be used to directly locate the start of data, relocation, or 
line number entries for the section. They can be readily used with the UNIX system 
function fseek(3S). 


5.1 Flags 


The lower 4 bits of the flag field indicate a section type, defined as follows: 


Mnemonic Flag 
STYP_REG Ox00 regular section 
(allocated, relocated, loaded) 
STYP_DSECT 0x01 dummy section 
(not allocated, relocated, not loaded) 
STYP_NOLOAD 0x02 noload section 
(allocated, relocated, not loaded) 
STYP_GROUP Ox04 grouped section 
(formed from input sections) 
STYP_PAD @x08 padding section 
(not allocated, not relocated, loaded) 
0x10 copy section 
(for a decision function used in updating 
fields; not allocated, not relocated, loaded, 
relocation and line number entries processed 
normally) 


STYP_TEXT 0x20 section contains only executable text 
STYP_DATA 0x40 section contains only initialized data 


STYP_BSS 0x80 = section contains only unintialized data 


The C language data structure that is used to declare section headers can be found on the 
UNIX Reference manual page SCNHDR(4). 


STYP_COPY 


5.2 .bss Section Header 


The one anomaly in the section header table is the entry for uninitialized data in a .bss 
section. A .bss section has a size, symbols that refer to it, and symbols that are defined in it. 
At the same time a .bss section has no relocation entries, no line number entries, and no 
data. Therefore, a .bss section has an entry in the section header table but occupies no space 
elsewhere in the file. In this case, the number of relocation and line number entries, as well 
as all file pointers in a .bas section header, are zero. 


6. Sections 


Figure 1 shows that section headers are followed by the appropriate number of bytes of text 
or data. The raw data for each section begins on a full word boundary in the file. 


Files produced by the UNIX system cc compiler and the es assembler always contain three 
sections, called .text, data, and .bes. The .text section contains the instruction text (i.e., 
executable code), the .data section contains initialized data variables, and the .bss section 
contains uninitialized data variables. 

The link editor "SECTIONS directives” (see Link Editor User's Manual) allow users to 
describe how input sections are to be combined, to direct where to place output sections, and 
to rename output sections. If no SECTIONS directives are given, each input section appears 
in an output section of the same name. For example, if a number of object files from the 
‘compiler are linked together, each containing the three sections .text, data, and .bss, the 
output object file will also contain three sections, .text, data, and .bss. 


7. Relocation Information 


Object files have one relocation entry for each relocatable reference in the text or data. The 
relocation information consists of entries with the following 10-byte format: 
TABLE 5. RELOCATION SECTION CONTENTS 
0-3 long int r_vaddr (Virtual) address of reference sa 
4-7 long int rsymndx Symbol table index 
8-9 unsigned short _r_typ Relocation Type (see below) 


The first 4 bytes of the entry is the virtual address of the text or data to which this entry 
applies. The next field is the index, counted from 0, of the symbol table entry that is being 
referenced. The type field indicates what type of relocation is to be applied. 


7.1 C Data Structure 


The following structure declaration is currently used for relocation entries: 


struct reloc { 
long = r_vaddr; /* virtual address of reference */ 
long = r_symndx; /* index into symbol table */ 
unsigned short r_type; [* relocation type */ 


, 


As the link editor reads each input section and performs relocation, the relocation entries are 
read. They direct how references found within the input section are treated. 


Relocation types currently recognized are: 


VAX RELOCATION TYPES 

Meaning 
Reference is absolute; no relocation is necessary. The 
entry will be ignored. 
Direct &-bit reference to the symbol's virtual address. 
Direct 16-bit reference to the symbol’s virtual address. 
Direct 32-bit reference to the symbol’s virtual address. 


A "PC-relative" 8-bit reference to the symbol’s virtual 
address. 
A "PC-relative" 16-bit reference to the symbo!l’s virtual 
address. 
A “PC-relative" 32-bit reference to the symbol’s virtual 
address. 


On the VAX processors relocation of a symbol index of -1 indicates that the relative 
difference between the current segment’s start address and the program’s load address is 
added to the relocatable address. 


The UNIX system as assembler automatically generates relocation entries which are then 
utilized by the link editor. The link editor uses this information to resolve external 
references in the file. 


8. Line Numbers 


When invoked with the proper option the UNIX system cc compiler generates an entry in 
the object file for every C language source line where a breakpoint can be inserted. Users 
can then reference line numbers when using a software debugger like sdb. All line numbers 
in a section are grouped by function, as shown below. 


symbol index 0 
physical address line number 
physical address line number 

symbol index 0 
physical address line number 


physical address line number 


The first entry in a function grouping has line number zero, and has an index into the 
symbol table for the entry containing the function name in place of the physical address. 
Subsequent entries will have actual line numbers and addresses of the text corresponding to 
the line numbers. The line number entries appear in increasing order of address. 


8.1 C Data Structure 


‘ The following structure declaration is currently used for line number entries: 


struct lineno 
union 
{ 
long |_symndx; /* symtbl index of func name */ 
long I_paddr; /* paddr of line number °/ 
} Laddr; 
unsigned short | Inno; /* line number */ 


};: 
9. Symbol Table 


Because of symbolic debugging requirements for SGS the order of symbols in the symbol 
table is very important. Symbols appear in the sequence shown as below: 


local symbols 
for function 1 


local symbols 
for function 2 


local symbols 
for function 1 


defined global 
mbols 


undefined global 
symbols 


Figure 2. COFF Symbol Table 


The word “statics” in Figure 2 means symbols defined in the C language storage class static 
outside any function. The symbol table consists of at least one fixed-length entry per 
symbol, with some symbols followed by auxiliary entries of the same size. The entry for 
each symbol is a structure that holds the name (null-padded), the structure value, and other 
information. 


9.1 Special Symbols 


The symbol table contains some special symbols that are generated by the compiler, 
assembler and link editor. These symbols are: 
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TABLE 6. SPECIAL SYMBOLS IN THE SYMBOL TABLE 


Symbol Meaning 


file file name 

text address of .text section 

data address of .data section 

ebas address of .bss section 

ebb address of start of inner block 
ab address of end of inner block 
bf address of start of function 
af address of end of function 


target pointer to the function returned structure or union 
See “Symbol Table Entries," Section 1.8.2. 

-xfake dummy tag name for structure, union, or enumeration 

208 end of members of structure, union, or enumeration 


Six of these special symbols occur in pairs. The -bb and .eb symbols indicate the boundaries 
of inner blocks; a .bf and .ef pair brackets each function; and a .xfake and .eos pair names 
and defines the limit of structures, unions, and enumerations that were not named. The .eos 
symbol also appears after named structures, unions, and enumerations. 


When a structure, union, or enumeration has no tag name (a perfectly legitimate C language 
syntax) the SGS must invent a name to be used in the symbol table. The name chosen for 
the symbol table is .xfake, where “x" is an integer. If there are 3 unnamed structures, unions, 
or enumerations in the source, their tag names will be ".O0fake”, “.1fake", and ".2fake’. 


Each of the special symbols has different information stored in the symbol table entry as well 
as the auxiliary entry. (See Sections 1.8.2 and 1.8.3.) 


Inner Blocks 


The C language defines a block as a compound statement that begins and ends with braces ( { 
and } ). An inner block is a block that occurs within statements such as if, switch, or while. 


For each inner block that has local symbols defined, a special symbol .bb is put in the symbol 
table immediately before the first local symbol of that block. Also a special symbol .eb is put 
in the symbol table immediately after the last local symbol of that block. The sequence is: 


local symbols 
for that block 


Because inner blocks can be nested by several levels, nested inner blocks may have the 
following sequence: 


Fe 


for block 1 
for block 2 


-bb for block m 


for block m 

for block m1 
eS > eee 

for block mn 
Reeneaees,. = Bi 


in ae a a oe | 
.eb for block 1 


Symbols for Functions 


For each function, a special symbol .bf is put between the function name and the first local 
symbol of the function in the symbol table. Also a special symbol .ef is put immediately 
after the last loca] symbol of the function in the symbol table. The sequence is: 


If the return value of the function is a structure or union, a special symbol .target is put 
between the function name and the .bf. The sequence becomes: 


The UNIX system cc compiler invents .target to store the function-returned structure or 
union. The symbol .target is an automatic variable with “pointer” type. Its value field in the 
symbol table entry is always zero. 
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9.2 Symbol Table Entries 


All symbols, regardless of storage class and type, have the same format for their entries in & 
the symbol table. The symbol table entries each contain the following 18 bytes of 
information. It should be noted that indices for symbol table entries begin at zero and count 


upward. 


TABLE 7. SYMBOL TABLE ENTRY FORMAT 


Bytes Declaration Name Description 
0-7 char n_name 8 char null-padded symbo] name 
811 long int n_value Symbol value; storage class dependent 


12-13 short n_scnum Section number of symbol 
14-15 unsigned short n_type Basic and derived type specification 


16 char n_sclass storage class of symbo] 
17 char m_numaux number of auxiliary entries 


The meaning of the entries in each of these fields is described below. 


Symbol] Names 


The name of a symbol is currently limited to 8 characters; longer names are truncated by the 
ec compiler. Some special symbols are generated by the compiler and link editor, as 
discussed under Special Symbols. The names of special symbols always start with a dot (*."), 
e.g., file, 5fake, .bb. 


Storage Classes 


The storage class field has one of the following values: 


Mnemonic Value Storage Class 


C_ AUTO 1 automatic variable 
C_EXT 2 external symbol 
C_STAT 3 static 

C_ REG 4 register variable 

C_ LABEL 6 label 

C MOS 8 member of structure 
C_ARG 9 function argument 
C STRTAG 10 structure tag 

C MOU 1) member of union 
C_UNTAG 12 union tag 

C_TPDEF 13 type definition 
C_ENTAG 15 enumeration tag 

C MOE 16 member of enumeration 
C_REGPARM 17 register parameter 
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Mnemonic Value Storage Class 


C_FIELD 18 bit field 

C BLOCK 100 beginning and end of block 
C_FCN 101 beginning and end of function 
C EOS 102 end of structure 

C FILE 103 file name 

C_ALIAS 105 duplicated tag 


The SGS compress utility, cprs, generates the C ALIAS mnemonic. This utility, which is 
described in the UNIX Reference manual, removes duplicated structure, union, and 
enumeration definitions and puts ALIAS entries in their places. 


There are also some “dummy” storage classes defined in the header file. They are used only 
internally by the compiler and the assembler. These storage classes are: 


Mnemonic Value Storage Class 


C_EFCN -1 physical end of function 

C EXTDEF 5 external definition 
C_ULABEL 7 undefined label 

C_USTATIC 14 uninitialized static 

C_LINE 104 used only by utility programs 


Storage Classes for Special Symbols 


Some special symbols are restricted to certain storage classes. They are: 
Special Symbol Storage Class 


file C_FILE 
-bb C_ BLOCK 
eb C_ BLOCK 
-bf C_FCN 
ef C FCN 
target C_AUTO 
xfake C_STRTAG, C_UNTAG, C_ENTAG 
206 C EOS 
text C_STAT 
data C STAT 
-bss C STAT 


Also some storage classes are used only for certain special symbols. They are summarized as 
follows: 


Storage Class Special Symbol 


C_BLOCK -bb, .eb 
C_ FCN -bf, .ef . 
C EOS 208 


C_FILE file 


wes 


Symbol Value Field 


The meaning of the ‘value* of a symbol depends on its storage class. This relationship is 
summarized as follows: 


Storage Class Meaning 


C_AUTO stack offset 

C_EXT relocatable address 
C STAT relocatable address 
C_REG register number 
C_LABEL relocatable address 
C_ MOS offset 

C_ ARG stack offset 

C STRTAG 0 

C_MOU offset 

C_UNTAG 0 

C_TPDEF 0 

C_ENTAG 0 

C_MOE enumeration value 
C_REGPARM _ register number 
C_FIELD bit displacement 
C_BLOCK relocatable address 
C_FCN relocatable address 
C_EOS size 

C_FILE (see below) 
C_ALIAS tag index 


If the current symbol is the last symbol in this object file that has storage class C_FILE (-file 
symbol), its value is the symbol table entry index of the first global symbol. Otherwise the 
symbol value equals the symbol table entry index of the next file symbol. That is, the .file 
entries form a one-way linked list in the symbol table. 


Relocatable symbols have a value equal to the virtual address of that symbol. When the 
section is relocated by the link editor, the value of these symbols changes. 


Section Number Field 


Section numbers have the following meaning: 


Mnemonic Section Number Meaning 

N_DEBUG 2 special symbolic debugging symbol 
N_ABS -1 absolute symbol 

N_UNDEF 0 undefined external symbol 

N_SCNUM 1-077767 section number where symbol was defined 


A special section number (-2) marks symbolic debugging symbols, including 
structure/union/enumeration tag names, typedefs, and the name of the file. A section 
number of -1 indicates that the symbol has a value, but is not relocatable. Examples of 
absolute-valued symbols include automatic and register variables, function arguments, and 
eos symbols. A section number of zero indicates a relocatable external symbol that is not 


a 


defined in the current file. The .text, data, and .bss symbols default to section numbers 1, 2, 


and 3 respectively. : 


Section Numbers and Storage Classes 


Symbols having certain storage classes are also restricted to certain section numbers, They 


are summarized as follows: 


Storage Class 


Type Entry 


The type field in the symbol table entry contains info 
type for the symbol. Each symbol has exactly one basic, 


Section Number 


N_ABS 
N_ABS, N_UNDEF, N_SCNUM 


rmation about the basic and derived 
or fundamental, type, but can have 


more than one derived type. The formtat of the 16-bit type entry is: 


Bits 0-3, called "typ", indicate one of the following fundamental types: 


Mnemonic 


Value 


Type 


type not assigned 
character 

short integer 
integer 

long integer 
floating point 


awh WN OS 


~fé« 


T_DOUBLE 7 double word 
T_STRUCT 8 structure 

T_UNION 9 union 

T_ENUM 10 enumeration 

T_MOE 11 member of enumeration 
T_UCHAR 12 unsigned character 
T_USHORT 13 unsigned short 
T_UINT 14 unsigned integer 
T_ULONG 15 unsigned long 


Bits 4-15 are arranged as six 2-bit fields marked "di" through "d6."° These "d" fields represent 
levels of the following derived types: 


Mnemonic Value Type 
DT_NON 0 no derived type 
DT_PTR 1 pointer 
DT_FCN 2 function 
DT_ARY 3 array 


The following examples demonstrate the interpretation of the symbol table entry 
representing type. 

char °func(); 
Here func is the name of a function that returns a pointer to a character. The fundamental 
type of func is 2 (character), the dl field is 2 (function), and the d2 field is 1 (pointer). 


Therefore the type word in the symbol table for func would contain the hexadecimal number 
0x62, which is interpreted to mean “function that returns a pointer to a character.” 


short *tabptr[10][25]13]; 


Here tabptr is a three dimensional array of pointers to short integers. The fundamental type 
of tabptr is 3 (short integer); the d1, d2, and d3 fields each contains a 3 (array), and the d4 
field is 1 (pointer). Therefore the type entry in the symbol table would contain the 
hexadecimal number 0x7f3, indicating a "three dimensional array of pointers to short 
integers.” 


Type Entries and Storage Classes 


The following table shows which type entries are legal for each storage class. 


nf 
> 


An 
ae 


; 


n 
x 
& 


‘e) 
if 
9] 


C_ALIAS 


TABLE 8. TYPE ENTRIES BY STORAGE CLASS 


Function? 


no 


no 


=~$%'s 


*d° entry----———— 
Array? Pointer? 
yes yes 
yes 7 
yes yes 
no yes 
no no 
yes yes 
nO yes 
no no 
yes yes 
no no 
yes yes 
no no 
no no 
no yes 
no no 
no no 
no no 
no no 
no no 
no no 


“typ” entry 
basic type 
Note 1 


Note 1 
Note 1 


Note 1: any except T.MOE Note 2: T_ENUM, T_UCHAR, T_USHORT, T_UINT, T.ULONG 
Note 3: T_STRUCT, T_UNION, TENUM 


Conditions for the "d" entries apply to dl through d6, except that it is impossible to have two 
consecutive derived types of “function.” 


Although function arguments can be declared as arrays, they will be changed to pointers by 


default. Therefore no function argument can have “array” as its derived type. 


Structure for Symbol Table Entries 


The following C language structure declaration is currently used for symbol table entries: 


} 
; 


#define SYMNMLEN 


#define SYMESZ 


n_name[SYMNMLEN]; 


n_value; 
n_scnum,; 
n_type; 
n_sclass; 
n_numaux,; 
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9.3 Auxiliary Table Entries 


/* symbol name °*/ 
/* value of symbol °*/ 
/* section number */ 


/* type and derived type */ 


/* storage class ° / 
/* number of aux entries °/ 


/* size of a symbol table entry */ 
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Currently, there is at most one auxiliary entry per symbol. The auxiliary table entry contains 
the same number of bytes as the symbol! table entry. However, unlike symbol table entries, 


the format of an auxiliary table entry of a symbol depends on its type and storage class. 
They are summarized as follows: 


TABLE 9. AUXILIARY SYMBOL TABLE ENTRIES 


Storage Type Entry Auxiliary 


sg Class di typ Entry Format 
file C_FILE DT_NON T_NULL file name 
-text,.data,.bss C STAT DT_NON T_NULL section 
tagname C_STRTAG, DT_NON T_NULL tag name 
C_UNTAG,C_ENTAG 
208 C_EOS DT_NON T_NULL end of 
structure 
fcname C_EXT,C_STAT DT_FCN Note 1 function 
arrname Note 2 DT_ARY Note 1 array 
ebb C BLOCK DT_NON T_NULL beginning of 
block 
ab C_BLOCK DT_NON T_NULL end of block 
bf,.ef C FNC DT_NON T_NULL beginning and 
end of function 
name related to Note 2 DT_PTR, T_STRUCT, name related to 
structure, DT_ARR, T_UNION, structure, 
union, DT_NON T_ENUM union, 
enumeration enumeration 


Note 1: Any except T_MOE. 


Note 2: C AUTO, C_STAT, C_MOS, C_MOU, C_TPDEF. 


In the above table “tagname” means any symbol name including the special symbol .xfake, 
and “fcname" and “arrname" represent any symbol name. 


Any symbol that satisfies more than one condition in Table 7 should have a union format in 
its auxiliary entry. Symbols that do not satisfy any of the above conditions should NOT have 
any auxiliary entry. 


File Names 


Each of the auxiliary table entries for a file name contains a 14-character file name in bytes 0 
through 13. The remaining bytes are zero, regardless of the size of the entry. 


Sections 


The auxiliary table entries for sections have the format: 


Tag Names 


= 


Bytes Declaration Name Description 

0-3 long int x_scnien section length 

4-5 unsigned short x_nreloc number of relocation entries 
6-7 unsigned short x_nlinno number of line numbers 
8-17 - dummy 0 


The auxiliary table entries for tag names have the format: 


Bytes Declaration 
0-5 - 

6-7 unsigned short 
8-11 - 

12-15 Jong int 

16-17. - 


End of Structures 


Name 
dummy 


x_size 


dummy 
x_endndx 


Description 

0 

size of _ struct, 
union or 
enumeration 

0 

index of next 


entry beyond this 
structure, union, 
or enumeration 

0 


The auxiliary table entries for the end of structures have the format: 


Functions 


Bytes Declaration 
0-3 long int 
45 - 


6-7 unsigned short 


Name 
x_tagndx 


dummy 
x_size 


dummy 


Description 


tag index 

0 

size of struct, 
union or 
enumeration 

0 


The auxiliary table entries for functions have the format: 


Bytes Declaration Name Description 

0-3 long int x_tagndx tag index 

a7 long int x_fsize size of function (in bytes) 

811 long int x_Innoptr file pointer to line number 

12-15 long int x_endndx index of next entry 
beyond this function 

16-17. unsigned short x_tvndx not used in UNIX system 


Arrays 


The auxiliary table entries for arrays have the format: 


Bytes Declaration Name Description 

0-3 long int x_tagndx tag index 

45 unsigned short x_inno line number of declaration 
6-7 unsigned short x_size size of the array 

8-9 unsigned short x_dimenf{0] first dimension 

10-11 unsigned short x_dimen{1] second dimension 

12-13. unsigned short x_dimen[2] third dimension 

1415 unsigned short x_dimen{3] fourth dimension 

16-17. - dummy 0 


End of Blocks and Beginning and End of Functions 


The auxiliary table entries for the end of blocks and the beginning and end of functions 


have the format: 


Bytes Declaration 
0-3 - 
45 unsigned short 
6-17 - 

Beginning of Blocks 


Name Description 

dummy 0 

xinno C-source line number 
dummy 0 


The auxiliary table entries for the beginning of blocks have the format: 


Bytes Declaration 


0-3 - 
45 unsigned short 
6-11 


Name Description 
dummy 0 
x_Inno C-source line number 


dummy 0 


-21- 


12-15 long int x_endndx index of next entry 
past this block 
16-17. - dummy e 


Names related to Structures, Unions, and Enumerations 


The auxiliary table entries for structure, union, and enumerations symbols have the format: 
Bytes Declaration Name Description 


0-3 long int x_tagndx tag index 

4-5 “ dummy 0 

6-7 unsigned short x_size size of the 
structure, wnion, 
or enumeration 

8-17 - dummy 0 


Names defined by “typedef" may or may not have auxiliary table entries. For example, 
typedef struct people STUDENT; 
struct people { 
char name{20]; 
long id; 


’ 


typedef struct people EMPLOYEE; 


The symbol “EMPLOYEE” will have an auxiliary table entry in the symbol! table but symbol 
‘STUDENT’ will not. 


C Data Structure 


The exact C language data structure for auxiliary symbol table entries can be found on the 
UNIX Reference manual page for SYMS(4). 


10. Access Routines 


Supplied with every standard UNIX system release are set of access routines that are used for 
reading the various parts of a Common Object File. Although the calling program must 
know the detailed structure of the parts of the object file it processes, the routines effectively 
insulate the calling program from the knowledge of the overall structure of the object file. 
In this way, users can concern themselves with the section they are interested in without 


needed to know all the object file details. 


The access routines can be divided into four categories: 
1. Functions that open or close an object file 
2. Functions that read header or symbol table information 


3. Functions that position an object file at the start of a particular section of the object file 
4. A function that returns the symbol table index for a particular symbol 


These routines can be found in /lib/libld.a and are listed in section 3 of the UNIX system 
Reference manual. A summary of what is available can be found in the UNIX system 
Reference manual under LDFCN(4). 


